A Multi-Strip Algorithm and Its Application to Gene Characterization Using DNA-Array Data

نویسندگان

  • Gilad Lerman
  • Joseph McQuown
  • Bud Mishra
چکیده

A fast adaptive multiscale algorithm has been devised to characterize a random set of points spanning a high dimensional Euclidean space, but concentrated around special lower dimensional subsets. It has been adapted to analyze gene expression data from microarray experiments. We present here the simplest version of this “multi-strip” algorithm applied to a set of points in R concentrated around a line. The algorithm characterizes this set by finding a strip around the principal axis of the set, so that it isolates deviating points from the main bulk of points enveloped by the strip. The algorithm generalizes to computing a strip around a best L d-plane, where 1 ≤ d < D, or even fitting a strip around a d-dimensional Lipschitz graph. We establish various estimates for its performance. When applied to gene-expression data, the algorithm can be thought of as estimating the local statistics (means, standard deviations, tail distributions, etc.) as a function of the entire expression range. Genes with abnormal differential expression values can be identified and given biological interpretations based on the local deviations in their statistics. By avoiding rigid local segmentations (as in segmental nearest neighbor normalization) or nonadaptive global estimates, the algorithm achieves a superior performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A MULTI-OBJECTIVE EVOLUTIONARY ALGORITHM USING DECOMPOSITION (MOEA/D) AND ITS APPLICATION IN MULTIPURPOSE MULTI-RESERVOIR OPERATIONS

This paper presents a Multi-Objective Evolutionary Algorithm based on Decomposition (MOEA/D) for the optimal operation of a complex multipurpose and multi-reservoir system. Firstly, MOEA/D decomposes a multi-objective optimization problem into a number of scalar optimization sub-problems and optimizes them simultaneously. It uses information of its several neighboring sub-problems for optimizin...

متن کامل

Improved Automatic Clustering Using a Multi-Objective Evolutionary Algorithm With New Validity measure and application to Credit Scoring

In data mining, clustering is one of the important issues for separation and classification with groups like unsupervised data. In this paper, an attempt has been made to improve and optimize the application of clustering heuristic methods such as Genetic, PSO algorithm, Artificial bee colony algorithm, Harmony Search algorithm and Differential Evolution on the unlabeled data of an Iranian bank...

متن کامل

A full ranking method using integrated DEA models and its application to modify GA for finding Pareto optimal solution of MOP problem

This paper uses integrated Data Envelopment Analysis (DEA) models to rank all extreme and non-extreme efficient Decision Making Units (DMUs) and then applies integrated DEA ranking method as a criterion to modify Genetic Algorithm (GA) for finding Pareto optimal solutions of a Multi Objective Programming (MOP) problem. The researchers have used ranking method as a shortcut way to modify GA to d...

متن کامل

Circularly Polarized Circular Slot Antenna Array Using Sequentially Rotated Feed Network

This paper presents the design, simulation, and measurement of two low-cost broadband circularly polarized (CP) printed antennas: a single element and an array at C band. The proposed single element antenna is excited by an L-shaped strip with a tapered end, located along the circular-slot diagonal line in the back plane. From the array experimental results, the 3 dB axial ratio bandwidth can r...

متن کامل

Mitochondrial DNA characterization of Sergentomyia sintoni populations and finding mammalian Leishmania infections in this sandfly by using ITS-rDNA gene

Sergentomyia sintoni is the natural vector of Sauroleishmania species of lizards. This sandfly isabundance in and around the burrows of great gerbils. S. sintoni was collected from peridomestic animalshelters, inside and around houses and also from the nearby burrows of the gerbil reservoir hosts,Rhombomys opimus, in several provinces of Iran. Mitochondrial Cytochrome b (Cyt b) of sandflies, wh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003